Genetic algorithm as a variable selection procedure for the simulation of C nuclear magnetic resonance spectra of flavonoid derivatives using multiple linear regression
نویسندگان
چکیده
In order to accurately simulate C NMR spectra of hydroxy, polyhydroxy and methoxy substituted flavonoid a quantitative structure–property relationship (QSPR) model, relating atom-based calculated descriptors to C NMR chemical shifts (ppm, TMS = 0), is developed. A dataset consisting of 50 flavonoid derivatives was employed for the present analysis. A set of 417 topological, geometrical, and electronic descriptors representing various structural characteristics was calculated and separate multilinear QSPR models were developed between each carbon atom of flavonoid and the calculated descriptors. Genetic algorithm (GA) and multiple linear regression analysis (MLRA) were used to select the descriptors and to generate the correlation models. Analysis of the results revealed a correlation coefficient and root mean square error (RMSE) of 0.994 and 2.53 ppm, respectively, for the prediction set. 2008 Elsevier Inc. All rights reserved.
منابع مشابه
Genetic algorithm as a variable selection procedure for the simulation of 13C nuclear magnetic resonance spectra of flavonoid derivatives using multiple linear regression.
In order to accurately simulate (13)C NMR spectra of hydroxy, polyhydroxy and methoxy substituted flavonoid a quantitative structure-property relationship (QSPR) model, relating atom-based calculated descriptors to (13)C NMR chemical shifts (ppm, TMS=0), is developed. A dataset consisting of 50 flavonoid derivatives was employed for the present analysis. A set of 417 topological, geometrical, a...
متن کاملQSAR studies and application of genetic algorithm - multiple linear regressions in prediction of novel p2x7 receptor antagonists’ activity
Quantitative structure-activity relationship (QSAR) models were employed for prediction the activity of P2X7 receptor antagonists. A data set consisted of 50 purine derivatives was utilized in the model construction where 40 and 10 of these compounds were in the training and test sets respectively. A suitable group of calculated molecular descriptors was selected by employing stepwise multiple ...
متن کاملQuantitative structure-activity relationship (QSAR) study of CCR2b receptor inhibitors using SW-MLR and GA-MLR approaches
In this paper, the quantitative structure activity-relationship (QSAR) of the CCR2b receptor inhibitors was scrutinized. Firstly, the molecular descriptors were calculated using the Dragon package. Then, the stepwise multiple linear regressions (SW-MLR) and the genetic algorithm multiple linear regressions (GA-MLR) variable selection methods were subsequently employed to select and implement th...
متن کاملPenalized Bregman Divergence Estimation via Coordinate Descent
Variable selection via penalized estimation is appealing for dimension reduction. For penalized linear regression, Efron, et al. (2004) introduced the LARS algorithm. Recently, the coordinate descent (CD) algorithm was developed by Friedman, et al. (2007) for penalized linear regression and penalized logistic regression and was shown to gain computational superiority. This paper explores...
متن کاملImproving Brain Magnetic Resonance Image (MRI) Segmentation via a Novel Algorithm based on Genetic and Regional Growth
Background:Â Regarding the importance of right diagnosis in medical applications, various methods have been exploited for processing medical images solar. The method of segmentation is used to analyze anal to miscall structures in medical imaging.Objective:Â This study describes a new method for brain Magnetic Resonance Image (MRI) segmentation via a novel algorithm based on genetic and regiona...
متن کامل